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(54) Title: HUMAN GENES AND GENE EXPRESSION PRODUCTS II 
(57) Abstract 

This invention relates to novel human polynucleotides and variants thereof, their encoded polypeptides and variants ^^^^ 
corresponding to these polynucleotides and to proteins expressed by the genes. The invention also relates to diagnoshc and thempeuuc 
agents employing such novel human polynucleotides, their corresponding genes or gene products, e.g., these genes and proteins, including 
probes, an ti sense constructs, and antibodies. 
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large numbers of oligodeoxyribonucleotides (oligos)) is described. The method is derived 
from DNA shuffling (Stemmer, Nature (1994) 370:389-391), and does not rely on DNA 
ligase, but instead relies on DNA polymerase to build increasingly longer DNA fragments 
during the assembly process. 
5 Appropriate polynucleotide constructs axe purified using standard recombinant 

DNA techniques as described in, for example, Sambrook et aL 9 Molecular Cloning: A 
Laboratory Manual 2nd Ed, (1989) Cold Spring Harbor Press, Cold Spring Harbor, NY, 
and under current regulations described in United States Dept. of HHS, National Institute 
of Health (NiH) Guidelines for Recombinant DNA Research. The gene product encoded 

10 by a polynucleotide of the invention is expressed in any expression system, including, for 
example, bacterial, yeast, insect, amphibian and mammalian systems. Suitable vectors and 
host cells are described in U.S. Patent No. 5,654,173. 

Bacteria. Expression systems in bacteria include those described in Chang et aL, 
Nature (1978) 275:615; Goeddel et al., Nature (1979) 281:544; Goeddel et a!., Nucleic 

1 5 Acids Res. (1980) 5:4057; EP 0 036,776; U.S. Patent No. 4,551,433; DeBoer et aL Proc. 
Natl Acad Set (USA) (1983) 80:21-25; and Siebenlist et al, Cell (1980) 20:269. 

Yeast. Expression systems in yeast include those described in Hinnen et aL, Proc. 
Natl Acad, ScL (USA) (1978) 75:1929; Ito et aL, J. Bacteriol (1983) 755:163; Kurtz et aL, 
Mol Cell Biol (1986) 6:142; Kunze etal.,J. Basic Microbiol (1985) 25:141; Gleeson et 

20 aL, J, Gen. Microbiol. (1986) 752:3459; Roggenkamp et aL, Mol Gen. Genet (1986) 
202:302; Das et aL, J. Bacteriol (1984) 158:1 1 65; De Louvencourt et aL, J. Bacteriol 
(1983) 154:737; Van den Berg et al 9 Bio/Technology (1990) 5:135; Kunze et al., J. Basic 
Microbiol (1985) 25:141; Cregg et al., Mol Cell Biol (1985) 5:3376; U.S. Patent Nos. 
4,837,148 and 4,929,555; Beach and Nurse, Nature (1981) 300:706; Davidow et aL, Curr. 

25 Genet. (1985) 70:380; Gaillardin etaL, Curr. Genet. (1985) 70:49; Ballance etaL 9 
Biochem. Biophys. Res. Commun. (1983) 7/2:284-289; Tilburn et aL, Gene (1983) 
26:205-221; Yelton et aL, Proc. Natl Acad ScL (USA) (1984) 57:1470-1474; Kelly and 
Hynes, EMBOJ. (1985) ¥:475479; EP 0 244,234; and WO 91/00357. 

Insect Cells. Expression of heterologous genes in insects is accomplished as 

30 described in U.S. Patent No. 4,745,05 1 ; Friesen et aL, "The Regulation of Baculovirus 

Gene Expression", in: The Molecular Biology OfBaculoviruses (1986) (W. Doerfler, ed,); 
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EP 0 127,839; EP 0 155,476; and Vlak et aU J. Gen. Virol. (1988) tfP:765-776; Miller et 
aU Ann. Rev. Microbiol (1988) 42:177; Carbonell et al. y Gene (1988) 73:409; Maeda et 
al y Nature (1985) 5/5:592-594; Lebacq-Verheyden et aU Mol Cell. Biol. (1988) 5:3129; 
Smith et at., Proc. Natl. Acad. Set (USA) (1985) 52:8844; Miyajima et al. 9 Gene (1987) 
5 55:273; and Martin et al., DNA (1988) 7:99. Numerous baculoviral strains and variants and 
corresponding permissive insect host cells from hosts are described in Luckow et ah, 
Bio/Technology (1988) tf:47-55, Miller et aL, Generic Engineering (1986) 5:277-279, and 
Maeda et al. y Nature (1985) 375:592-594. 

Mammalian Cells. Mammalian expression is accomplished as described in 

10 Dijkema et aL 9 EMBO J. (1985) 4:761, Gorman et aL 9 Proc. Natl Acad. Sci. (USA) (1982) 
79:6777, Boshart et aL 7 Cell (1985) 47:521 and U.S. Patent No. 4,399,216. Other features 
of mammalian expression are facilitated as described in Ham and Wallace, Meth. Enz. 
(1979) 55:44, Barnes and Sato,^4«a/. Biochem. (1980) 102:255, U.S. Patent Nos. 
4,767,704, 4,657,866, 4,927,762, 4,560,655, WO 90/103430, WO 87/00195, and U.S. RE 

15 30,985. 

Polynucleotide molecules comprising a polynucleotide sequence provided herein 
propagated by placing the molecule in a vector. Viral and non-viral vectors are used, 
including plasmids. The choice of plasmid will depend on the type of cell in which 
propagation is desired and the purpose of propagation. Certain vectors are useful for 

20 amplifying and making large amounts of the desired DNA sequence. Other vectors are 
suitable for expression in cells in culture. Still other vectors are suitable for transfer and 
expression in cells in a whole animal or person. The choice of appropriate vector is well 
within the skill of the art. Many such vectors are available commercially. The partial or 
full-length polynucleotide is inserted into a vector typically by means of DNA ligase 

25 attachment to a cleaved restriction enzyme site in the vector. Alternatively, the desired 

nucleotide sequence can be inserted by homologous recombination in vivo. Typically this 
is accomplished by attaching regions of homology to the vector on the flanks of the desired 
nucleotide sequence. Regions of homology are added by ligation of oligonucleotides, or by 
polymerase chain reaction using primers comprising both the region of homology and a 

30 portion of the desired nucleotide sequence, for example. 
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skill in the art, including those described in 5,242,974; 5,384,261; 5,405,783; 5,412 087; 
5 424,186; 5,429,807; 5,436,327; 5,445,934; 5,472,672; 5,527,681; 5,529,756; 5,545,531; 
5 554,501; 5,556,752; 5,561,071; 5,599,895; 5,624,71 1; 5,639,603; 5,658,734; WO 
93/17126; WO 95/1 1995; WO 95/35505; EP 742287; and EP 799897. The arrays of the 
5 subject invention find use in a variety of applications, including gene expression analyse, 
drug screening, mutation analysis and the like, as disclosed in the above-listed exemplary 
patent documents. 

In addition to the above nucleic acid libraries, analogous libraries of polypeptides 
are also provided, where the where the polypeptides of the library will represent at least a 
10 portion of the polypeptides encoded by "SEQ ID NOS .1-5252." 

VII. Utilities 

A it~ ofPolvnucle-'^ in MmwniF and in Tissue Profiling 

Polynucleotide probes, generally comprising at least 12 contiguous nucleotides of a 
15 polynucleotide as shown in the Sequence Listing, are used for a variety of purposes, such 
as chromosome mapping of the polynucleotide and detection of transcription levels. 
Additional disclosure about preferred regions of the disclosed polynucleotide sequences xs 
found in the Examples. A probe that hybridizes specifically to a polynucleotide Closed 
herein should provide a detection signal at least 5-, 10-, or 20-fold higher than the 
20 background hybridization provided with other unrelated sequences. 

^ g in section ^ vr ^inn Levels. Nucleotide probes are used to detect 
expression of a gene corresponding to the provided polynucleotide. In Northern blots 
roRNA is separated electrophoretically and contacted with a probe. A probe is detected as 
hybridizing to an mRN A species of a particular size. The amount of hybridization is 
25 quantitated to determine relative amounts of expression, for example under a particular 
condition. Probes are used for in situ hybridization to cells to detect expression. Probes 
can also be used in vivo for diagnostic detection of hybridizing sequences. Probes are 
typically labeled with a radioactive isotope. Other types of detectable labels can be used 
such as chromophores, fluors, and enzyme, Other examples of nucleotide hybridization 
30 assays are described in WO92/02526 and U.S. Patent No. 5,124,246. 
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et al., Methods in Molecular Biology (1997) 68:1, Boultwood, «L, Human Press, Totowa, 
NJ. 

Polynucleotides are mapped to particular chromosomes using, for example, 
radiation hybrids or chromosome-specific hybrid panels. See Leach et al.. Advances in 
Genetics, (1995) 35:63-99; Walter et al., Nature Genetics (1994) 7:22; Walter and 
Goodfellow, Trends in Genetics (1992) 9:352. Panels for radiation hybrid mapping are 
available from Research Genetics, Inc., Huntsville, Alabama, USA. Databases for markers 
using various panels are available via the world wide web at http:/F/shgc- 
www.stanford.edu; and htt p /A^-oenomr wi mit edu/cpi-hin/c^ntip/rhmapper.pl . The 
statistical program RHMAP can be used to construct a map based on the data from 
radiation hybridization with a measure of the relative likelihood of one order versus 
another. RHMAP is available via the world wide web at 
http://www.sph.umich.edu/group/statgen/software. 

In addition, commercial programs are available for identifying regions of 
chromosomes commonly associated with disease, such as cancer. Polynucleotides based 
on the polynucleotides of the invention can be used to probe these regions. For example, if 
through profile searching a provided polynucleotide is identified as corresponding to a gene 
encoding a kinase, its ability to bind to a cancer-related chromosomal region will suggest 
its role as a kinase in one or more stages of tumor cell development/growth. Although 
some experimentation would be required to elucidate the role, the polynucleotide 
constitutes a new material for isolating a specific protein that has potential for developing a 

cancer diagnostic or therapeutic. 

Ticc. Ty ping or Profiling. Expression of specific mRNA corresponding to the 
provided polynucleotides can vary in different cell types and can be tissue-specific. This 
variation of mRNA levels in different cell types can be exploited with nucleic acid probe 
assays to determine tissue types- For example, PCR, branched DN A probe assays, or 
blotting techniques utilizing nucleic acid probes substantially identical or complementary 
to polynucleotides listed in the Sequence Listing can determine the presence or absence of 
the corresponding cDNA or mRNA. 

For example, a metastatic lesion is identified by its developmental organ or tissue 
source by identifying the expression of a particular marker of that organ or tissue. If a 
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polynucleotide is expressed only in a specific tissue type, and a metastatic lesion is found 
to express that polynucleotide, then the developmental source of the lesion has been 
identified. Expression of a particular polynucleotide is assayed by detection of either the 
corresponding mRNA or the protein product. Immunological methods, such as anubody 
staining, are used to detect a particular protein product. Hybridization methods can be used 
to detect particular mRNA species, including but not limited to in situ hybridization and 
Northern blotting. 

TTg ^ f Polymorphisms. A polynucleotide of the invention will be useful in 
forensics, genetic analysis, mapping, and diagnostic applications if the corresponding 
region of a gene is polymorphic in the human population. Particular polymorphic forms of 
the provided polynucleotides can be used to either identify a sample as deriving from a 
suspect or rule out the possibility that the sample derives from the suspect. Any means for 
detecting a polymorphism in a gene are used, including but not limited to electrophoresis of 
protein polymorphic variants, differential sensitivity to restriction enzyme cleavage, and 
hybridization to allele-specific probes. 
B. Antibody Production 

Expression products of a polynucleotide of the invention, the corresponding mRNA 
or cDNA, or the corresponding complete gene are prepared and used for raising antibodies 
for experimental, diagnostic, and therapeutic purposes. For polynucleotides to wmch a 
corresponding gene has not been assigned, this provides an additional method of 
identifying the corresponding gene. The polynucleotide or related cDNA is expressed as 
described above, and antibodies are prepared. These antibodies are specific to an eprtope 
on the polypeptide encoded by the polynucleotide, and can precipitate or bind to the 
corresponding native protein in a cell or tissue preparation or in a cell-free extract of an m 

vitro expression system. 

Immunogens for raising antibodies are prepared by mixing the polypeptides 
encoded by the polynucleotides of the present invention with adjuvants. Alternatively, 
polypeptides are made as fusion proteins to larger immunogenic proteins. Polypeptides are 
also covalently linked to other larger immunogenic proteins, such as keyhole limpet 
hemocyanin. Immunogens are typically administered intradermal*, subcutaneously, or 
intramuscularly. Immunogens are administered to experimental animals such as rabbits, 
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In addition to the antibodies discussed above, genetically engineered antibody 
derivatives are made, such as single chain antibodies, according to methods well known in 
the art. 

C. TI*g nf Polvm..pl«»ntide S to C ™**™* Arrays for Diagnostics 
Polynucleotide arrays provide a high throughput technique that can assay a large 
number of polynucleotide sequences in a sample. This technology can be used as a 
diagnostic and as a tool to test for differential expression to determine function of an 
encoded protein. Arrays can be created by spotting polynucleotide probes onto a substrate 
(e.g., glass, nitrocelllose, etc.) in a two-dimensional matrix or array having bound probes. 
The probes can be bound to the substrate by either covalent bonds or by non-specific 
interactions, such as hydrophobic interactions. Samples of polynucleotides can be 
detectably labeled (e.g., using radioactive or fluorescent labels) and then hybridized to the 
probes. Double stranded polynucleotides, comprising the labeled sample polynucleotides 
bound to probe polynucleotides, can be detected once the unbound portion of the sample is 
washed away. Techniques for constructing arrays and methods of using these arrays are 
described in EP No. 0 799 897; PCT No. WO 97/29212; PCT No. WO 97/27317; EP No. 0 
785 280; PCT No. WO 97/02357; U.S. Pat. No. 5,593,839; U.S. Pat No. 5,578,832; EP 
No. 0 728 520; U.S. Pat. No. 5,599,695; EP No. 0 721 016; U.S. Pat. No. 5,556,752; PCT 
No. WO 95/22058; and U.S. Pat No. 5,631,734. 

As discussed in some detail above, arrays can be used to examine differential 
expression of genes and can be used to determine gene function. For example, arrays of the 
instant polynucleotide sequences can be used to determine if any of the provided 
polynucleotides are differentially expressed between a test cell and control cell {e.g., cancer 
cells and normal cells). For example, high expression of a particular message in a cancer 
cell, which is not observed in a corresponding normal cell, can indicate a cancer specific 
protein. Exemplary uses of arrays are further described in, for example, Pappalarado et al, 
Sem. Radiation Oncol. (1998) 8:217; and Ramsay Nature Biotechnol. (1998) 75:40. 
D. Differential Expression 

The polynucleotides of the invention can also be used to detect differences in 
expression levels between two cells, as a method to identify abnormal or diseased 
tissue in a human. For polynucleotides corresponding to profiles of protein families, the 
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to, amniotic fluid, chorionic villi, blood, and the blastomcre of an in vitro-fertilized 
embryo. The comparable normal polynucleotide-related gene is obtained from any tissue. 
The mRNA or protein is obtained from a normal tissue of a human in which the 
polynucleotide-related gene is expressed. Differences such as alterations in the nucleotide 
5 sequence or size of the same product of the fetal polynucleotide-related gene or mRNA, or 
alterations in the molecular weight, amino acid sequence, or relative abundance of fetal 
protein, can indicate a germline mutation in the polynucleotide-related gene of the fetus, 
which indicates a genetic predisposition to disease. Particular diagnostic and prognostic 
uses of the disclosed polynucleotides are described in more detail below. 

10 E. Diagnostic. Prognostic, and Other Uses Based On Differential Expression 

In general, diagnostic methods of the invention for involve detection of a level or 
amount of a gene product, particularly a differentially expressed gene product, in a test 
sample obtained from a patient suspected of having or being susceptible to a disease {e.g., 
breast cancer, lung cancer, colon cancer and/or metastatic forms thereof), and comparing 

15 the detected levels to those levels found in normal cells (e.g., cells substantially unaffected 
by cancer) and/or other control cells (e.g., to differentiate a cancerous cell from a cell 
affected by dysplasia). Furthermore, the severity of the disease can be assessed by 
comparing the detected levels of a differentially expressed gene product with those levels 
detected in samples representing the levels of differentially gene product associated with 

20 varying degrees of severity of disease. 

The term "differentially expressed gene" is intended to encompass a polynucleotide 
that can, for example, include an open reading frame encoding a gene product (e.g. , a 
polypeptide), and/or introns of such genes and adjacent 5* and 3* non-coding nucleotide 
sequences involved in the regulation of expression, up to about 20 kb beyond the coding 

25 region, but possibly further in either direction. The gene can be introduced into an 
appropriate vector for extrachromosomal maintenance or for integration into a host 
genome. In general, a difference in expression level associated with a decrease in 
expression level of at least about 25%, usually at least about 50% to 75%, more usually at 
least about 90% or more is indicative of a differentially expressed gene of interest, /.e., a 

30 gene that is underexpressed or down-regulated in the test sample relative to a control 
sample. Furthermore, a difference in expression level associated with an increase in 
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. . t-no/ to 75% more usually at least 

about 90% aid can be at least about 1 Mold. usually « "<=" 

can be about .00-fold to — . ,000-fold n-ease ^ve to a control - 
Lcanve of . dually expressed gene of inte^ I . . - overexpressed or up- 

reading Same encoding a gene Deduct) W urnquely .defies . drfferen y xp 
gene so tha, de*c«on of the —a.., expressed po,ynuc.eot,de - a ^ 

express* polynucleotides" is also mean, to encompass ^ of the enclosed 

Zlogous, subsuntraH, simila, or sub^anuaUy idenuca. (,*, havtng about 90* 
sequence identity) to the disclosed polynucleotides. 

McUodsof the subject invention useful in diagnosis or prognosrs 
^atisonof^^c.of.selec^difle^aUyex^ssedgJ.epn^ma 

exo^onofu-genep^uctwheteAedni^^bemeasupe " 

Lression product deleted in the sample «i* «* •* P»*"* m 5 

express.™ prod ^^^^^ ^ ^ ^..^e,,,,, 

curve. Acompansoncanbemadev^ally.b, . awmresentative library of cDNA 
wjmorvrtmoutcompufcrizedassist^ 

llo f n^A^^.-^^^^ to,he, '^l 

JbJ!f clones of the same gene pmduc, in a control s»np.«; - ^ — ~ " 
^^HybHd^onp^^ofacon.,, 
of different methods for determining the nucleic «c.d abundance .n sample 
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of different methods for determining the nu< 

45 



PCT/US99/01619 

WO 99/38972 

Any of. variay of detectable labels can be used in common with the various 
embodiments of the diagnostic memo* of the invent Suitable detectable M -nclude 
fluorochrornes..;,.*. fluorescein isothiocyana* (FITQ, rbodamine, Texas Red, 
phycoervtiuu.al,ophycoc,.nin.^^ 

dichloro-o^xyfluorescein, o^boxy-X-rhodanuue (ROX), 6*»rboxy-2 ,4 ,7 ,4 7- 
^^fluorescein (HEX), 5-ca*oxynu„rescein (5-FAM) orN^-^-antethyW- 
carboxyrhodaoune (TAMRA)), radioactive labels, (*.,. »S. U - J. - 
de«**abl= label can involve a two stage sy*ems («.*. biotir«vidin, hapten-anh-hapten 

antibody, etc.} . , 

K^ews specific for the polynucleotides and polypeptides of the invention, sucb as 

expression product in a biological samp*. The kit can also contain buffers or labenng 
comp^ as well as instructions for using the reagents to detect and ou^tify express™ 
p^ducts in the biological santple. ExempUtr, embodiments of the diagnostic metttod, of 

the invention are described below in more detail. 

P^^r ^^^^n^ to one embodiment, the test sample >s assayed 
^^■evelofadifferermaUy^opressedpolypeptide. Diagnosis can be accomplished 
.sing any of a number of methods to determine the absence or presence or altenrf amounts 
cfme diffcretmany expressed polypeptide in the test sample. For exa^e, detection can 
unite sunning of cells or histological sections with labeled antibodies, performed m 
ac«^~wi* convention.!™^ Cells can be penneabilized to satin cytoplasmte 

molecule . 
po,ypeptide of the invention are added <o a sample, and incubau* for a period of time 
sufficient to allow binding to the epitope, usually « leas. *>» "> • The antibody 

can be deteaabl, labeled for dim* detection (.e.g., using radioisotopes, enzymes, 
fluoresces chemilurninesccrs, and me like), or can be used in conjunction with, second 
stag, antibody or reagent to dettct binding <*.*, biotin with horseradish peroxrdase- 
conjugated avidin, a secondary antibody conjugated to a fluorescent compound, eg 
fluorescein, rhodantine, Text, red, The abs^co o, presence of antibody bmdmg can 
be deterrnined by various memods,ir^^ 

microscopy, radiography, scintiUation coaming, e,c. Any suitable alternative methods can 
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• • a of levels or amounts of differentially expressed 

of Qualitative or quantitative detection of levels or am 

FT ISA western blot, immunoprecipitation, 
polypeptide can be used, for example ELISA, western 

radioimmunoassay, etc. 

,» general, the <te.ee.ed level of differentially exposed polypepUde n, the test 

ir iSm p 1 e^, in a„o m ^Hoe g ^e^o IU1 .e«n^ ga ^d^ 

„v„,ve^oI^Ae,^b,ageneco^^ 
Nucleotides of .he invention An, — * - «« 

example, * hybridization in tissue sections, by reverse ^ 
Northcmblo^coru^gpolyA+mRNA. ^^^^^t^L 

bXl~ons or dysplastic - -..pared ^ *. expression of the m*KA „ rtf^e 
sjle, a pc*. or negative control sample <*.. — tissue, —us tissu. 

" C ' Any suttee method for deleting and comparins *A expression levels in . 

sample can be used in connection v*h «» diagnostic meti»ds of .he 

U.S 5,804,382). For example, mRNA expression levels in a sample can be 

of a library of expressed seouence „ <BSTs> the sample, ^ £ EST 

;„ «amole (Adams, et al., (1991 ) science 
library is representative of sequences present m the sample lAaams, 

7 . ^- „ ^ncT. within the library can be 

252 1651). Enumeration of the relative representation of ESTs within * 

sample. Th— <«« — -■-^-^""'"i! 

of. referee s-nple » deKnnme me relative expression levels of a select 

polynucleotide, particularly a polynucleotide corresponding to one or more of the 
differentially expressed genes described herein. 

Alternatively, gene expression in a - sample can be performed 
analysis of gene expression (SAGE) methodology (Velculescu e. al.. »_ 0»» 



) 

analysis of gene expression (SAGE) method 
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■ r .. Jr ., rr , The diagnosuc methods of the 

invention can focus on the expression of , single differently expressed gene. For 
rxan^eo.edi^osucmea^cantavo.ve^ 

example, me umbo „u™ in an ccxiinE region or control region), 

polymorrMsmofs U ch.Eene(e.g.,«polyrnorph.smmaneoom g reg 

nl^iaudwith disease. Disease-associared polymorphisms can mdude delator 

„mcauo„ of me gene, mutanons that alter expression leve. and/or aflec, acavty of me 

dually gene can be compared ,o expression .eve!s of me normal allele* - van- 
meutodsxnowninthear, Memods for deUrmining promo- or 

qu^tauon of me expressed natural protein; insernon of me vnrian, ««. element into 
vector with a report gene such as p-galacfosidase, .uciferase, chloramphenicol 
acetyltransferase,*,. that provides for convemem quantitation; and the uxe. 

sp^c sequence, .disease associated polyrnorphisn. Where large amounts of DN A 
gen^c DNA is used dheC, Altemaovely, the region of interest - cloned 
Z a suitabfc L. and grown in sufficient quantity for analysis. Cells to* express a 
miu , ^fwviTJMA which can be assayed 

differerrtiall, expressed gene can be used as a source of mRNA, which 
n^y or reverse transcribed into cDNA for analysis. The nucleic acid c^i he amplified 
by conventional techniques, such as the polymerase chain reaction (PCR), to r-ov.de 
snfflcient.uno.ms .or analysis, and a dete«ab.e label can he incited m me arnp.^on 

MM. d«~uon. The use of the polymerase chain re^don is desenbed ,» Sat*, ■« <■>■> 
Science (1985) 2,9-Ml, and a review of techniques can be found in Sambroox, e, a,.. 
; Mo^a„^. ^^^.09 8 9)pp. »«. Alternate >*-°^ 
.^arelmov^artthattmliie „,i g onuc.e„«id« " ° ™^ j"^"* 

p„,ym.rpl»^, for examples see Rile, - 

Delahuntyera'., .<»■../.»»» . Gwwr. (1996) 58:1239. 

1 sample nucleic acid, amplified or cloned fragment, is ana.yzed by one of a 
0 nnmherofmemodsKnowttmrne^ The nucleic acid can be sequenced by dideoxy or 
othe, memods. and me sequence of bases compared to a selected sequence, to a wild- 
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bre** tissue, ,. discrimina* between breast cancer, « differ™, cells of origin, .o 
discrirnina* between bre«* cancers with different potential metastatic r*es, etc. 

o .,..„ f P» l y n .,cieoti e « th,Per»Mr ^nn l on ml Antagonise 

Polypeptides encoded by the insuun polynucleotides and cc^esponding full lengti, 
5 gen esea,beusedtosc^peptideu,^es.oide^ 
from among the encoded polypeptides. 

A library of peptides can be syntitesized Mowing th. methods disclosed n, U.S. 
Pat No. 5.0.0.175 (M75X and in WO 91,17*23. As described below in brief, one prer-es 

,0 sigraJ ^sanction and receptor binding activity. hfc 1»«**.-*r** 

amino acids. The concen»ati.„ of eacb amino acid in the ration mixtine . balanced or 

equimola, mixture of amino acids coupled to the starting resin. The bound »» nerds are 
,5 utendepro^^dn^etedwi* another bala«*d amino acid mixtirre to f«m an 

e^olarrrdxnneofaUpossibledipcpUde, This process is repe«ed nun,, mature of 

^tee i, is known *. • P^tar amino acid is essential in . given position), thus 
e^ple.d, ft. mixtine of peptides is sc^ed for binding to ft. select polypeptide. The 
the desired activity are then isolated and sequenced. 

The method described in WO 91/17823 is similar. However, ins«*d of re**** the 

equal portions (or into a number of portions corresponding to the number of drfferent 
amino acids to be added in that *ep), and each ^ acid is coupled individuaUy to 
portion of resin. The resin portion, are then combined, mixed, and again dmded rnto a 
numberof equal portions for ^on vriu. to second amino acid. In tins manner, each 
30 reaction can be easily driven to completion. Addition*, one « n^tainsepar** 

■subpools" by seating porti<ms in pandlel, r*her titan combining -1 -sins a. each tap. 
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receptor binding or signal transduction activity. 

In such eases, thesubpools confining, .-2.000 candidal each are exposed to 

then resized as a group ofsnnuler s»bpoo.s (snt-subpools) . «*• 

candidal, and reassayed. Positive sub-subpools can be resynthesized as nuWual 
compounds, and assayed Anally to determine the peptides that exhibit a high btndtng 
constant These peptides can be tes*d for their ability to inhibit - enhance the native 

The mrfhods described in WO 91/7S23 and U.S. PaUntNo. S..94.392 (heretn 
incorporated by reference, enable the preparation of such pools and subpoofc by autotnau* 
techniques in pandle., such t. » synthesis and resynutesis can be performed . a marter 

Peptide agonist or antagonists are screened using any available memod, such as 
signal transduction, antibody binding, receptor binding, mitogen* assays, ehemo«ms 
^ etc Tne memods described herein are presently referred. The assay eondmons 
ideally shot*, resemble me conditions under which the ntfive activity is exhibtted . vm>, 
ti-is.underphysiologiepa^nf^.andionicstrengttv Suitable agomsts or 
.rra^onistsvnll exhibit strong inhibition or enlumcementof me native activity at 
^nc^c^n^donotc^u-xiesideetTe^mti-suhJec. Agists or antics 

that compel for binding to tne native poiypeptide can require conations — 
^ than the nrfve concentre., v*ile inhibitors c^le of binding nxeverstbly ti, the 

The end results of such serening and experimentation will be « leas, one novel 
polyene binding partner, such as a receptor, encoded by a gene or a cDNA 

antagonist of the novel Wnding parser. Such agonist and antagonist can be used to 

ceUsthat possess the recepti>r as a result of genetic engineering. Further, if the novel 
reoepu-r snares biological!, important charac«ristics with a known reoepttr, utformation 
about agonist/antagonist binding can facilitate development of unproved 
agonisrs/antagonists of the known receptor. 
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H nh^rei.tical r- ;ti "" s and Therapeutic Use s 
Pharmaceutical compositions can comprise polypeptides, antibodies, or 
poiynucleotidesoftheclaimedinvention. T*e pharmaceutical compositions i~ 
athTrapeutically effective amount of either polypeptides, antibodies, or polynucleo^ 

the claimed invention. 

The term •*«ra^utica..y effective amount" as used herein refers to an amount of. 

^^.e^uucorp^nuuive^^e^canbedc^by.for^, 
^ markers or .eve>, Therapeutic etTec* ateo include reduction » phystca. 

such as deceased body temperature. The precise effective amount for a subject 

the theses or combination of therapeutics select for administiation. Thus. .. . no, 
useful to specify an exact effective amount in advance. However. *" ff -™ t f 

from about 0.0, mg/ kg to 50 mg/kg or 0.05 mg*g to about 10 mg/kg of me DNA 
constructs in the individual to which it is administered. 

A ptorrnaceutical composition c» also contain a pluumaceuticaUy acceptable 
^ Th«term"pharmace«tic^accep<^ 

Junction of antibodies harmful individual receiving me c.unposrao. and which 
can be administered without undue toxicity. Suitabie carriers can be l»ge. slowly 
^n^macromolecoles such as proteins, polysaccharides, polylactic acds. 
polyglycotic acids, polymeric amino acids, amino acid copolymers, and inactive vtrus 
particles. Such carriers are well known to those of ordinary skill in the art. 

Pharmaceutic*!!, acceptitble salts can be us«l therein, for example, nnneralactd 
salts such as hydrochlorides, nydrobromides, phosphates, sulfates, and the like; and me 
salBrforgaric^idssuch^aceuues.pmpionates. malona.es. ber^andthehke. A 
morongh discussion of pharmaceuticaUy accept excipients is arable in s 
Pharmaceutical Sciences (Mack Pub. Co., NX 1991). 
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SEQ ID 


Filing 


SEQ ID 


Sequence Name 


NO: 


Date of 


NO: in 






Priority 


Priority 






Appln 


Appln 




2922 


2/24/98 


1195 


RTA00000422F.e.23.1 


2923 


2/24/98 


1 168 


RTA00000421F b.06.1 


2924 


2/24/98 


580 


RTA00000403 F.o.07. 1 


2925 


2/24/98 


531 


RTA00000345 F.n. 1 2. 1 


2926 


2/24/98 


154 


RTA00000340F.b.21.l 


2927 


2/24/98 


19 


RTA00000123A.k.23.1 


2928 


2/24/98 


1265 


RTA00000340F.d.07. 1 


2929 


2/24/98 


1124 


RTA00000 1 23 A.h.22. 1 


2930 


2/24/98 


1241 


RTA00000408FJ.14.1 


2931 


2/24/98 


534 


RTAOOOOO 126A.g.7.1 


2932 


2/24/98 


694 


RTA00000418F.C.07.I 


2933 


2/24/98 


1034 


RTAOOOOO 124A.f. 16.3 


2934 


2/24/98 


790 


RTA00000345F.n.08.1 


2935 


2/24/98 


613 


RTAOOOOO I22AJ.22.1 


2936 


2/24/98 


885 


RTAOOOOO 122A.j. 17.1 


2937 


2/24/98 


1262 


RTAOOOOO 1 22 A.h.4.1 


2938 


2/24/98 


135 


RTA00000122A.d.5.1 


2939 


1/28/98 


391 


RTA00000 1 79AF.C.20.3 


2940 


2/24/98 


537 


RTA00000408F.1.09. 1 


2941 


2/24/98 


683 


RTA00000403F.j.21.1 


2942 


2/24/98 


343 


RTA00000422F.g.21.1 


2943 


2/24/98 


226 


RTAOOOOO 125A.k. 10.1 


2944 


2/24/98 


763 


RTA00000 1 35 A.m. 1 8. 1 


2945 


2/24/98 


156 


RTA00000125A.k.l.I 


2946 


2/24/98 


597 


RTA00000135A.I.1.2 


2947 


2/24/98 


586 


RTA00O00 1 25A.g.24. 1 


2948 


2/24/98 


467 


RTAOOOOO 1 23 A.n. 1 3 2 


2949 


2/24/98 


830 


RTA00000347F.b.08. 1 


2950 


2/24/98 


997 


RTA00000134A.I.9.1 


2951 


2/24/98 


371 


RTA00000403FJ.17.1 


2952 


2/24/98 


33 


RTA00000403FJ.15.1 


2953 


2/24/98 


1209 


RTA00000408F.n .05 .2 


2954 


2/24/98 


530 


RTA00000408F.n.02.2 


2955 


2/24/98 


1213 


RTAOOOOO 1 3 5 A.a.23 . 1 


2956 


2/24/98 


347 


RTAOOOOO 1 25 A.n.4.1 


2957 


2/24/98 


472 


RTAOOOOO 135A.f. 14.2 


2958 


2/24/98 


243 


RTA00000410F.C.14.1 


2959 


2/24/98 


919 


RTA00000410F.d.l8.1 


2960 


2/24/98 


825 


RTA00000404F.k.22.2 


2960 


2/24/98 


364 


RTA00000404F.k.22. 1 


2961 


2/24/98 


825 


RTA00000404F.k.22.2 


2961 


2/24/98 


364 


RTA00000404F.k.22. 1 


2962 


2/24/98 


595 


RTA000004IOF.d.lO.I 


2963 


2/24/98 


175 


RTA 000004 1 OF.d.09. 1 


2964 


2/24/98 


206 


RTA00000410F.b.l5.1 


2965 


2/24/98 


1083 


RTA 000004 1 8F.j .20.1 


2966 


2/24/98 


922 


RTA000004 1 0F.e.09. 1 


2967 


2/24/98 


1035 


RTA00000404F.k.l5.1 


2968 


2/24/98 


1 167 


RTA00000410F.C.06.1 


2969 


2/24/98 


53 


RTA00000410F.C.04.1 



194 



Clone Name Cluster 
ID 



M0000 1 567D:B03 1 9246 

M00001567A:B09 2113 

M0000 1 579C: AO 1 3903 7 

MOOOO 1 528A:C04 7337 

MOOOO 1 533 D: A08 800 1 

M0O00 1 533 A:G05 803 1 3 

MOOOO 1532D:A06 0 

MOOOO 1 53 2 A :C0 1 1 7 1 24 

MOOOO 1 530A:E1 0 1 200 1 

MOOOO 1 548 A:H04 1 902 

MOOOO 1 529D:C05 73245 

MOOOO 1 536 A:F 11 47430 

M00001517A:B11 0 

M00001516A:F06 81 151 

M00001516A:D02 62736 

MOOOO 1 5 1 4 A:G03 33 576 

M0OO01513A:F05 81 155 

MOOOO 1 396 A:C03 4009 

M00001 530A:A09 75487 

MOOOO 1 540D:E02 24723 

MOOOO 1 583 A:F07 1 7232 

MOOOO 1 545 A:F02 8 1 644 

MOOOO 1 545 A:C03 19255 

MOOOO 1 545 A:B 12 0 

MOOOO 1 545 A:B 10 39426 

M00001 544A:F05 80397 

MOOOO 1 534A:D03 39 1 67 

MOOOO 1 54 1 B:E05 1 759 1 

M00001535A:D10 81814 

M00001539D:B10 38563 

MOOOO 1 539B:G07 23840 

MOOOO 1 539A:H02 77883 

MOOOO 1539A:E01 76993 

MO0001537A:H05 27054 

MOOOO 1 546A:D08 8 1 984 

MOOOO 1 542 A:G 1 2 79969 

MOOOO 1 634A:H05 77809 

MOOOO 1635D:D05 75458 

MOOOO 1 635D:CI 2 39084 

M 0000 1 63 5 D:C 1 2 39084 

MOOOO 1635D:C 12 39084 

MOOOO 1 635 D:C 1 2 39084 

MOOOO1635B:H02 77561 

M00001 635B:H0 1 76964 

MOOOO 1 633C:F09 77 1 00 

MOOOO 1 634D:D04 77 1 0 1 

M00001 636A:F08 76093 

M00001 634A:B04 1 8225 

M00001633D:H06 77784 

MOOOO 1 633 D:G09 74099 
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Nearest Neighbor 
(BlastN vs. Genbank) 


Nearest Neighbor 
(BlastX vs. Non-Redundant Prot ins) 


SEQ 
ID 


ACCESSION 


DESCRIPTION 


P VALUE 


ACCESSION 


DESCRIPTION 


P VALUE 


2917 


X80283 


P.polycephalum 
genomic DNA 

IsVPllulLIlIIlg 1 dlf 1 

repetitive element 


3.3 


<NONE> 


<NONE> 


<NONE> 


2918 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
iTLtvJN/-\., complete 
cds 


6e-006 


<NONE> 


<NONE> 


<NONE> 


2919 


U95094 


Xenopus laevis 
AL.-1IN C t IN r 

(XL-INCENP) 
mRNA, complete 
cds 


6e-006 


<NONE> 


<NONE> 


<NONE> 


2920 


Z97333 


Homo sapiens 
RHCE gene 


9e-020 


113667 


!!!! ALU CLASS 
B WARNING 
ENTRY !!!! 


4e-005 


2921 


AF082350 


Homo sapiens 
bone 

morphogenetic 
protein 15 
precursor 
(BMP 15) gene, 
exon 2 and 
complete cds 


1 


<NONE> 


<NONE> 


<NONE> 


2922 


LI 4684 


Rattus norvegicus 
nuclear-encoded 
mitochondrial 
elongation factor 
G mRNA, 
complete cds. 


0 


585084 


ELONGATION 
FACTOR G, 
MITOCHONDRI 

AL JrxvJti^ U KovJK 

(MEF-G) 
>gi|543383|pirl|S4 
0780 translation 
elongation factor 
G, mitochondrial - 
rat >gi|3 10102 


9e-089 


2923 


D78335 


Human mRNA 
for 5 '-terminal 
region of UMK, 
complete cds 


e-163 


1718058 


URIDINE 

(URIDINE 

MONOPHOSPHO 

KINASE) 

>gi|471981 

(L31783) uridine 

kinase 


7e-072 


2924 


U95094 


Xenopus laevis 
XL-INCENP 
(XL-INCENP) 
mRNA, complete 
cds 


2e-007 


<NONE> 


<NONE> 


<NONE> 
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We Claim: 

1. A library of polynucleotides, the library comprising the sequence information of 
at least one of SEQ ID NOS:l-3544, 3546-4510, 4512-4725, 4727-4748, and 4750-5252. 

5 2. The library of claim 1 , wherein the library is provided on a nucleic acid array. 

3. The library of claim 1, wherein the library is provided in a computer-readable 

format. 

10 4. The library of claim 1, wherein the library comprises a differentially expressed 

polynucleotide comprising a sequence selected from the group consisting of SEQ ID 
NOS:65, 174, 203, 252, 253, 387, 419, 420, 491, 552, 560, 581, 590, 648, 693, 726, 746, 
990, 1095, 1124, 1205, 1354, 1387, 1780, 1899, 1915, 1979,2007, 2024, 2245, and 2325. 

15 5. The library of claim 1 , wherein the library comprises a polynucleotide 

differentially expressed in a human breast cancer cell, where the polynucleotide comprises 
a sequence selected from the group consisting of SEQ ID NOS:15, 36, 44, 45, 89, 146, 154, 
159, 165, 174, 172, 183, 203, 261, 364, 366, 387, 419, 420, 496, 503, 510, 512, 529, 552, 
560, 564, 570, 590, 606, 644, 646, 693, 707, 71 1, 726, 746, 754, 756, 875, 902, 921, 942, 

20 990, 1095, 1104, 1122, 1131, 1142, 1170, 1184, 1205, 1286, 1289, 1354, 1387, 1435, 
1535, 1751, 1764, 1777, 1795, 1860, 1869, 1882, 1890, 1915, 1933, 1934, 1979, 1980, 
2007, 2023, 2040, 2059, 2223, 2245, 2300, 2325, 2409, 2462, 2486, 2488, and 2492. 

6. The library of claim 1, wherein the library comprises a polynucleotide 
25 differentially expressed in a human colon cancer cell, where the polynucleotide comprises 
a sequence selected from the group consisting of SEQ ID NOS:33, 65, 228, 250, 252, 253, 
280, 282, 355, 370, 387, 443, 460, 491, 545, 560, 581, 603, 680, 693, 703, 704, 716, 726, 
746, 752, 753, 1095, 1104, 1205, 1241, 1264, 1354, 1387, 1401, 1442, 1514, 1734, 1742, 
1780, 1851, 1899, 1915, 1954, 2024, 2066, 2262, and 2325. 
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7. The library of claim 1, wherein the library comprises a polynucleotide 
differentially expressed in a human lung cancer cell, where the polynucleotide comprises a 
sequence selected from the group consisting of SEQ ID NOS : 1 0, 54, 65, 1 71 , 1 74, 203, 
252, 253, 254, 285, 419, 420, 466, , 491, 525, 526, 552, 571, 574, 590, 693, 700, 726, 742, 

5 746, 861, 990, 922, 1088, 1288, 1355, 1417, 1422, 1444, 1454, 1570, 1597, 1979, 2007, 
2024, 2034, 2038, 2126, and 2245. 

8. The library of claim 1, wherein the library comprises a polynucleotide 
differentially expressed in a human cancer cell, where the polynucleotide comprises a 

1 0 sequence selected from the group consisting of SEQ ID NOS :648 and 1 899. 

9. An isolated polynucleotide comprising a nucleotide sequence having at least 
90% sequence identity to an identifying sequence of SEQ ID NOS:l-3544, 3546-4510, 
4512-4725, 4727-4748, and 4750-5252, or a degenerate variant or fragment thereof. 

10. The polynucleotide of claim 9, wherein the polynucleotide comprises a 
sequence of one of SEQ ID NOS:2503, 2504, 2550, 2555, 2578, 2656, 2667, 2712, 2723, 
2728, 2738, 2734, 2754, 2758, 2760, 2832, 2835, 2842, 2843, 2849, 2893, 2933, 2956, 
2971, 2981, 3009, 3018, 3019, 3046, 3084, 3190, 3129, 3173, 3226, 3227, 3274, 3290, 
3356, 3365, 3377, 3381, 3390, 3391, 3404, 3407, 3408, 3409, 3418, 3419, 3451, 3597, 
3600, 3618, 3632, 3635, 3646, 3648, 3657, 3665, 3669, 3670, 3671, 3656, 3680, 3686, 
3695, 3696, 3700, 3710, 3736, 3762, 3763, 3774, 3775, 3791, 3804, 3806, 3836, 3895, 
3905, 3919, 3920, 3927, 3936, 3951, 3974, 3998, 4036, 4038, 4044, 4056, 4072, 41 17, 
41 19, 4152, 4153, 4154, 4172, 4175, 4159, 4175, 4205, 4216, 4223, 4228, 4238, 4241, 
4243, 4251, 4253, 4261, 4263,4278, 4288, 4322, 4330, 4343, 4359, 4363, 4364, 4365, 
4373, 4375, 4384, 4385, 4406, 4409, 4431, 4434, 4441, 4442, 4444, 4455, 4469, 4473, 
4477, 4482, 4489, 4495, 4496, 4498, 4525, 4535, 4536, 4540, 4560, 4616, 4562, 4586, 
4605, 4629, 4653, 4654, 4658, 4659, 4660, 4661, 4664, 4665, 4668, 4684, 4682, 4688, 
4689, 4710, 4718, 4733, 4724, 4733, 4746, 4755, 4760, 4710, 4777, 4785, 4792, 4794, 
4801, 4807, 4821, 4822, 4847, 4850, 4854, 4856, 4866, 4885, 4900, 4901, 4905, 4914, 
4925, 4929, 4931, 4943, 4944, 4959, 5111, 5020, 5041, 5046, 5059, 5083, 5090, 5094, 
5102, 5125, 5174, 5197, 5208, 5217, 5237, 5239, 5241, 5243, 5248, and 5252. 
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11. A recombinant host cell containing the polynucleotide of claim 9. 

12. An isolated polypeptide encoded by the polynucleotide of claim 9. 
5 1 3. An antibody that specifically binds a polypeptide of claim 1 2. 

14. A vector comprising the polynucleotide of claim 9. 

1 5. A polynucleotide comprising the nucleotide sequence of an insert contained in 
10 a clone deposited as ATCC accession number xx, xx, xx, xx, xx, xx, xx, xx, or xx. 

16. A method of detecting differentially expressed genes correlated with a 
cancerous state of a mammalian cell, the method comprising the step of: 

detecting at least one differentially expressed gene product in a test sample derived 
1 5 from a cell suspected of being cancerous, where the gene product is encoded by a gene 

corresponding to a sequence of at least one of SEQ ID NOSrlO, 15, 33, 36, 44, 45, 54, 65, 
89, 146, 154, 159, 165, 171, 172, 174, 183, 203, 228, 250, 252, 253, 254, 261, 280, 282, 
285, 355, 364, 366, 370, 387, 419, 420, 443, 460, 466, 491, 496, 503, 510, 512, 525, 526, 
529, 545, 552, 560, 564, 570, 571, 574, 581, 590, 603, 606, 644, 646, 648, 680, 693, 700, 
20 703, 704, 707, 71 1, 716, 726, 742, 746, 752, 753, 754, 756, 861, 875, 902, 921, 922, 942, 
990, 1088, 1095, 1104, 1122, 1131, 1142, 1170, 1184, 1205, 1286, 1288, 1289, 1354, 
1355, 1387, 1417, 1435, 1444, 1454, 1535, 1570, 1597, 1734, 1742, 1751, 1764, 1777, 
1780, 1795, 1860, 1869, 1882, 1890, 1899, 1915, 1933, 1934, 1954, 1979, 1980,2007, 
2023, 2024, 2034, 2040, 2059, 2126, 2223, 2245, 2262, 2300, 2325, 2409, 2486, 2462, 
25 2488, 2492, 1241, 1264, 1401, 1422, 1442, 1514, 1851, 1915, 2007, 2024, 2038, 2066, and 
2245; 

wherein detection of the differentially expressed gene product is correlated with a 
cancerous state of the cell from which the test sample was derived. 

30 1 7. The method of claim 1 6, wherein said detecting step is by hybridization of the 

test sample to a reference array, wherein the reference array comprises an identifying 
sequence of at least one of SEQ ID NOS: 65, 174, 203, 252, 253, 387, 419, 420, 491, 552, 
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560, 581, 590, 648, 693, 726, 746, 990, 1095, 1 124, 1205, 1354, 1387, 1780, 1899, 1915, 
1979, 2007, 2024, 2325, and 2245. 

18. The method of claim 16, wherein the cell is a breast tissue derived cell, and the 
5 differentially expressed gene product is encoded by a gene corresponding to a sequence of 

at least one of SEQ ID NOS:36, 44, 45, 89, 146, 154, 159, 165, 172, 174, 183,203,261, 
364, 366, 387, 419, 420, 496, 503, 510, 512, 529, 552, 560,564, 570, 590, 606, 644, 646, 
693, 707, 71 1, 726, 746, 754, 756, 875, 902, 921, 942, 990, 1095, 1 104, 1 122, 1131,1 142, 
1170, 1184, 1205, 1286, 1289, 1354, 1387, 1435, 1535, 1751, 1764, 1777, 1795, 1860, 
10 1869, 1882, 1890, 1915, 1933, 1934, 1979, 1980, 2007, 2023, 2040, 2059, 2223, 2245, 
2300, 2325, 2409, 2462, 2486, 2488, and 2492. 

19. The method of claim 16, wherein the cell is a colon tissue derived cell, and the 
differentially expressed gene product is encoded by a gene corresponding to a sequence of 

15 at least one of SEQ ID NOS:33, 65, 228, 250, 252, 253, 280, 282, 355, 370, 387, 443, 460, 
491, 545, 560, 581, 603, 680, 693, 703, 704, 716, 726, 746, 752, 753, 1095, 1 104, 1205, 
1241, 1264, 1354, 1387, 1401, 1442, 1514, 1734, 1742, 1780, 1851, 1899, 1915, 1954, 
2024, 2066, 2262, and 2325. 

20 20. The method of claim 1 6, wherein the cell is a lung tissue derived cell, and the 

differentially expressed gene product is encoded by a gene corresponding to a sequence of 
at least one of SEQ ID NOS: 10, 54, 65, 171, 174, 203, 252, 253, 254, 285, 419, 420, 466, 
491, 525, 526, 552, 571, 574, 590, 693, 700, 726, 742, 746, 861, 922, 990, 1088, 1288, 
1355, 1417, 1422, 1444, 1454, 1570, 1597, 1979, 2007, 2024, 2034, 2038, 2126, and 2245. 
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21 . The method of claim 16, wherein the differentially expressed gene product is 
encoded by a gene corresponding to a sequence of at least one of SEQ ID NOS:648 and 
1899. 
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<223> ri = A, T, C or G 



<400> 2921 

gttactcctc tnanatcagc tacttganga tccctcgatt ngaattcngc acgaggcgat 60 

ttatttnaca gagttaaggg gccagtacac fctnatggtat aaaattatct ttntcagggg 120 

atgaaggcac aaggagaaaa ttacttgaag cfctggagafcc ttctctggca agcaatttac 180 

aaafctctggt gttcttngat ctggctcccn gcccagacaa ccanggagtt ntfcnatgfctc 240 

tatcctcatg tgnnannact atacgcaata attngncntn ngccatanag gagggatccg 300 

atanncgaca tngctntccn ncanatatac tncncntgna atgnnnctna taatgcatnn 360 

nntnnattcc tntctaggnt acncnnantt atatntnntn ggnaactcat ttaacancaa 42 O 

nttcacngca ttcccnfcggg gttacatata cnctnaagac tatgctgana ctgtgcacca 480 

tgnctacatn ngggaattgg atggggtgct tnacggactn ccttgnatgc aagnacttac 540 

cagacgtttc canccaanct gacattgntg naatgcatta cncacntggt gntncaantt 600 

tactacacct cganaggacc gttcacnggn atttaacctn tcaaanatng ttcnnanggt 66 O 

tacaaggtcc ccaattgtnn ganccttggg gctttgncaa cn 702 



<210> 2922 
<211> 708 
<2±2> DNA 

<213> Homo sapiens 
<220> 

<22l> misc_f eature 
<222> (1) . . . (708) 
<223> n = A,T,C or G 



<400> 2922 

anaccnttta nnctngttct ttttgcagga tcccatcgat tcgaattcgg cacgaggtat 60 

actttgacac tgagaacaaa gagacagtta fcatctggaat gggagaatta cacctggaaa 120 

tctatgctca gaggctggaa agagagtatg gctgtcctfcg tatcacagga aagccaaaag 18 O 

ttgcctttcg agagaccatt actgcccctg tcccgtttga ctttacacat aaaaaacaat 240 

caggtggfcgc aggccagtat ggaaaagtaa taggtgtcct ggagcctctg gacccagagg 300 

actacactaa attggaattt tcagatgaaa cattcggatc aaatattcca aagcagtttg 3 60 

tgcctgctgt agaaaagggg tttttagatg cctgcgagaa gggccctctt tctggtcaca 420 

agctictctgg gctccggttt gtcctgcaag atggagcaca ccacatggtt gattctaatg 480 

aaatctcttt catccgagca ggagaaggtg ctcttaaaca agccttggca aatgcaacat 540 

tatgtattct tgaacctatt atggcfcgtgg aagttgtagc tccaaatgaa tttcagggac 600 

aagtaattgc aggaattaac cgacgccatg gggtaatcac tgggcaagat ggagttgagg 660 

actattttac actgtatgca gatgtccctc taaatgatat gttgggnt 708 

<210> 2923 

<211> 715 

<212> DNA 

<213> Homo sapiens 



<220> 

< 2 2 1 > misc_f eature 

<222> (1) (715) 

<223> n = A, T, C or G 



<400> 2923 

gimnnnttct aatgcnnggc tnttntgcag gatcccatcg attcgctccc attcccggaa 60 

ggaggagaca gttactgtct atcccgcaga cgtggtgctc tttgaaggga tccfcggggca 120 

gaatgaggtg gactatcgcc agaagcaggt ggtcatcctg agccaggata gcttctaccg 180 

tgtccttacc tcggagcaga aggccaaagc cctgaagggc cagttcaact ttgaccaccc 240 

ggatgccttt gacaatgaac tcattctcaa aacactcaaa gaaatcactg aagggaaaac 300 
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agtccagatc cccgtgtatg actttgtctc ccattcccag gaggtacgag acctgttcca 360 

gatgaagctt fcttgtggata cagatgcgga cacccggctc tcacgcagag tattaaggga 42 0 

catcagcgag agaggcaggg atcttgagca gattttatct cagtacatta cgttcgtcaa 480 

gcctgccttt gaggaatfcct gcttgccaac aaagaagtat gctgatgtga tcatccctag 540 

aggtgcagat aatctggtgg ccatcaacct catcgtgcag cacatccagg acatcctgaa 600 

tggagggccc ttcaaacggc agaccaatgg ctgtctcaac ggctacaccc cttcacgcaa 660 

gangcangca tcggagtnca gcagcaggcc gcattgaccc gtcttcatcg gaccc 715 

<2X0> 2924 
<211> 724 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> tnisc_feature 
<222> (1) . . . (724) 
<223> n = A, T, C or 6 

<400> 2924 

gggnctttan atctataggn tacaggctac fctgfctcfcttt tgcaggatcc catccgatgc 60 

gcaagtaaga aaacatggcg gctatccttc tctcacatcg aaaaggaaat tttgaacaat 120 

catggaaaat cfcnggncgtg ctgngaaaac anagaagaga aatgttgcag gaaagattgt 180 

ttaanactaa tgaaatacct tttagaacag ctganagaaa ggtttaacng acaaaaanca 240 

tctggataaa tnntcttctt atcatgtgaa aactgccttc tttnacntat gtncccagna 300 

ccctcaanac agfccagtgng accanacnga nctggncctn tgctttgaria actggatgac 360 

attcttgntn nattgcctna ggtcagatnn acttgagaat tagttcatcc nnncttcaat 420 

ctatcctctt gcagaattnt ttgacatnta cntcagcaat ntttgctnta ncanagnccn 480 

afcgtaggata tctatgacct nncanngttt gafcgantncn tgcnnctgna tnnnncgaga 540 

gatntcctaa cna tnncann nnntaanttc tggtantgct caacagattg gaaaaagggg 600 

ccaganctgt gnctnaangg ttaaaancnc aggannagta ttttncgtaa acatgnaaan 660 

gnttangact gttcatnnnt tgntcctccg aaantgggca cccnttntta ttnattccnc 720 

tgcg 724 

<210> 2925 
<211> 748 
<212> DNA 

<213> Homo sapiens 
<220> 

<22l> misc_feature 

<222> (1) (748) 

<223> n = A,T, C or G 

<400> 2925 

ggtttanttt aaatccntnc ncagctactt gttctttttg caggatccca tcgattcgaa 60 

ttcggcacga gcggacccat cggagcgtaa cctggatctc cgcaggcctg gcggaggccg 120 

gccacctgga ggggcattgc ttggttcgcg tggtancaga ggagctitgag aatgttcgca 180 

tcttaccaca tacagttctt tacatggctg attcagaaac tttcattagt ctggaagagt 24 O 

gtcgtggcca taagagagca aggaaaagaa ctagtatgga aacagcactt gcccttgaga 300 

agctattccc caaacaatgc caagtccttg ggattgtgac cccaggaatt gtagtgactc 360 

caatgggatc angtagcaat cgacctcagg aaatagaaat tggagaatct ggttttgctt 420 

tattattccc ttcaaattga aggaataaaa atncaaccct ttcattttat taaggatcca 480 

aagaatttaa cattagaaag acatnaactt actgaagtag gtcttttaga taccctgaac 540 

ttcgtgtggt cttgnctttg gttataattg ctgtaaggtg ggagccagta attatctgca 600 

gcaagtagtc acncttttca gtgatatgaa tatcatcttt ggcttggang ccantngaca 660 

acctgncatt actgactttt tgaaaanaac cctctggata ttgatgcctc gggtgtggtt 720 
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